272 research outputs found

    Error, reproducibility and sensitivity : a pipeline for data processing of Agilent oligonucleotide expression arrays

    Get PDF
    Background Expression microarrays are increasingly used to obtain large scale transcriptomic information on a wide range of biological samples. Nevertheless, there is still much debate on the best ways to process data, to design experiments and analyse the output. Furthermore, many of the more sophisticated mathematical approaches to data analysis in the literature remain inaccessible to much of the biological research community. In this study we examine ways of extracting and analysing a large data set obtained using the Agilent long oligonucleotide transcriptomics platform, applied to a set of human macrophage and dendritic cell samples. Results We describe and validate a series of data extraction, transformation and normalisation steps which are implemented via a new R function. Analysis of replicate normalised reference data demonstrate that intrarray variability is small (only around 2% of the mean log signal), while interarray variability from replicate array measurements has a standard deviation (SD) of around 0.5 log2 units ( 6% of mean). The common practise of working with ratios of Cy5/Cy3 signal offers little further improvement in terms of reducing error. Comparison to expression data obtained using Arabidopsis samples demonstrates that the large number of genes in each sample showing a low level of transcription reflect the real complexity of the cellular transcriptome. Multidimensional scaling is used to show that the processed data identifies an underlying structure which reflect some of the key biological variables which define the data set. This structure is robust, allowing reliable comparison of samples collected over a number of years and collected by a variety of operators. Conclusions This study outlines a robust and easily implemented pipeline for extracting, transforming normalising and visualising transcriptomic array data from Agilent expression platform. The analysis is used to obtain quantitative estimates of the SD arising from experimental (non biological) intra- and interarray variability, and for a lower threshold for determining whether an individual gene is expressed. The study provides a reliable basis for further more extensive studies of the systems biology of eukaryotic cells

    A critical evaluation of methods for the reconstruction of tissue-specific models

    Get PDF
    Under the framework of constraint based modeling, genome-scale metabolic models (GSMMs) have been used for several tasks, such as metabolic engineering and phenotype prediction. More recently, their application in health related research has spanned drug discovery, biomarker identification and host-pathogen interactions, targeting diseases such as cancer, Alzheimer, obesity or diabetes. In the last years, the development of novel techniques for genome sequencing and other high-throughput methods, together with advances in Bioinformatics, allowed the reconstruction of GSMMs for human cells. Considering the diversity of cell types and tissues present in the human body, it is imperative to develop tissue-specific metabolic models. Methods to automatically generate these models, based on generic human metabolic models and a plethora of omics data, have been proposed. However, their results have not yet been adequately and critically evaluated and compared. This work presents a survey of the most important tissue or cell type specific metabolic model reconstruction methods, which use literature, transcriptomics, proteomics and metabolomics data, together with a global template model. As a case study, we analyzed the consistency between several omics data sources and reconstructed distinct metabolic models of hepatocytes using different methods and data sources as inputs. The results show that omics data sources have a poor overlapping and, in some cases, are even contradictory. Additionally, the hepatocyte metabolic models generated are in many cases not able to perform metabolic functions known to be present in the liver tissue. We conclude that reliable methods for a priori omics data integration are required to support the reconstruction of complex models of human cells.Acknowledgments. S.C. thanks the FCT for the Ph.D. Grant SFRH/BD/ 80925/2011. The authors thank the FCT Strategic Project of UID/BIO/04469/2013 unit, the project RECI/BBB-EBI/0179/2012 (FCOMP-01-0124-FEDER-027462) and the project “BioInd - Biotechnology and Bioengineering for improved Industrial and Agro-Food processes”, REF. NORTE-07-0124-FEDER-000028 Co-funded by the Programa Operacional Regional do Norte (ON.2 - O Novo Norte), QREN, FEDER

    Post-Transcriptional Regulation of BCL2 mRNA by the RNA-Binding Protein ZFP36L1 in Malignant B Cells

    Get PDF
    The human ZFP36 zinc finger protein family consists of ZFP36, ZFP36L1, and ZFP36L2. These proteins regulate various cellular processes, including cell apoptosis, by binding to adenine uridine rich elements in the 3′ untranslated regions of sets of target mRNAs to promote their degradation. The pro-apoptotic and other functions of ZFP36 family members have been implicated in the pathogenesis of lymphoid malignancies. To identify candidate mRNAs that are targeted in the pro-apoptotic response by ZFP36L1, we reverse-engineered a gene regulatory network for all three ZFP36 family members using the ‘maximum information coefficient’ (MIC) for target gene inference on a large microarray gene expression dataset representing cells of diverse histological origin. Of the three inferred ZFP36L1 mRNA targets that were identified, we focussed on experimental validation of mRNA for the pro-survival protein, BCL2, as a target for ZFP36L1. RNA electrophoretic mobility shift assay experiments revealed that ZFP36L1 interacted with the BCL2 adenine uridine rich element. In murine BCL1 leukemia cells stably transduced with a ZFP36L1 ShRNA lentiviral construct, BCL2 mRNA degradation was significantly delayed compared to control lentiviral expressing cells and ZFP36L1 knockdown in different cell types (BCL1, ACHN, Ramos), resulted in increased levels of BCL2 mRNA levels compared to control cells. 3′ untranslated region luciferase reporter assays in HEK293T cells showed that wild type but not zinc finger mutant ZFP36L1 protein was able to downregulate a BCL2 construct containing the BCL2 adenine uridine rich element and removal of the adenine uridine rich core from the BCL2 3′ untranslated region in the reporter construct significantly reduced the ability of ZFP36L1 to mediate this effect. Taken together, our data are consistent with ZFP36L1 interacting with and mediating degradation of BCL2 mRNA as an important target through which ZFP36L1 mediates its pro-apoptotic effects in malignant B-cells

    Using ILP to Identify Pathway Activation Patterns in Systems Biology

    Get PDF
    We show a logical aggregation method that, combined with propositionalization methods, can construct novel structured biological features from gene expression data. We do this to gain understanding of pathway mechanisms, for instance, those associated with a particular disease. We illustrate this method on the task of distinguishing between two types of lung cancer; Squamous Cell Carcinoma (SCC) and Adenocarcinoma (AC). We identify pathway activation patterns in pathways previously implicated in the development of cancers. Our method identified a model with comparable predictive performance to the winning algorithm of a recent challenge, while providing biologically relevant explanations that may be useful to a biologist

    Pathogen- and Host-Directed Antileishmanial Effects Mediated by Polyhexanide (PHMB)

    Get PDF
    BACKGROUND:Cutaneous leishmaniasis (CL) is a neglected tropical disease caused by protozoan parasites of the genus Leishmania. CL causes enormous suffering in many countries worldwide. There is no licensed vaccine against CL, and the chemotherapy options show limited efficacy and high toxicity. Localization of the parasites inside host cells is a barrier to most standard chemo- and immune-based interventions. Hence, novel drugs, which are safe, effective and readily accessible to third-world countries and/or drug delivery technologies for effective CL treatments are desperately needed. METHODOLOGY/PRINCIPAL FINDINGS:Here we evaluated the antileishmanial properties and delivery potential of polyhexamethylene biguanide (PHMB; polyhexanide), a widely used antimicrobial and wound antiseptic, in the Leishmania model. PHMB showed an inherent antileishmanial activity at submicromolar concentrations. Our data revealed that PHMB kills Leishmania major (L. major) via a dual mechanism involving disruption of membrane integrity and selective chromosome condensation and damage. PHMB's DNA binding and host cell entry properties were further exploited to improve the delivery and immunomodulatory activities of unmethylated cytosine-phosphate-guanine oligodeoxynucleotides (CpG ODN). PHMB spontaneously bound CpG ODN, forming stable nanopolyplexes that enhanced uptake of CpG ODN, potentiated antimicrobial killing and reduced host cell toxicity of PHMB. CONCLUSIONS:Given its low cost and long history of safe topical use, PHMB holds promise as a drug for CL therapy and delivery vehicle for nucleic acid immunomodulators

    A robust method for estimating gene expression states using Affymetrix microarray probe level data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Microarray technology is a high-throughput method for measuring the expression levels of thousand of genes simultaneously. The observed intensities combine a non-specific binding, which is a major disadvantage with microarray data. The Affymetrix GeneChip assigned a mismatch (MM) probe with the intention of measuring non-specific binding, but various opinions exist regarding usefulness of MM measures. It should be noted that not all observed intensities are associated with expressed genes and many of those are associated with unexpressed genes, of which measured values express mere noise due to non-specific binding, cross-hybridization, or stray signals. The implicit assumption that all genes are expressed leads to poor performance of microarray data analyses. We assume two functional states of a gene - expressed or unexpressed - and propose a robust method to estimate gene expression states using an order relationship between PM and MM measures.</p> <p>Results</p> <p>An indicator 'probability of a gene being expressed' was obtained using the number of probe pairs within a probe set where the PM measure exceeds the MM measure. We examined the validity of the proposed indicator using Human Genome U95 data sets provided by Affymetrix. The usefulness of 'probability of a gene being expressed' is illustrated through an exploration of candidate genes involved in neuroblastoma prognosis. We identified the candidate genes for which expression states differed (un-expressed or expressed) when compared between two outcomes. The validity of this result was subsequently confirmed by quantitative RT-PCR.</p> <p>Conclusion</p> <p>The proposed qualitative evaluation, 'probability of a gene being expressed', is a useful indicator for improving microarray data analysis. It is useful to reduce the number of false discoveries. Expression states - expressed or unexpressed - correspond to the most fundamental gene function 'On' and 'Off', which can lead to biologically meaningful results.</p

    The Pathway Coexpression Network: Revealing pathway relationships.

    Get PDF
    A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer's Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/
    corecore